The Impact of Multifunctional Genes on "Guilt by Association" Analysis

نویسندگان

  • Jesse Gillis
  • Paul Pavlidis
چکیده

Many previous studies have shown that by using variants of "guilt-by-association", gene function predictions can be made with very high statistical confidence. In these studies, it is assumed that the "associations" in the data (e.g., protein interaction partners) of a gene are necessary in establishing "guilt". In this paper we show that multifunctionality, rather than association, is a primary driver of gene function prediction. We first show that knowledge of the degree of multifunctionality alone can produce astonishingly strong performance when used as a predictor of gene function. We then demonstrate how multifunctionality is encoded in gene interaction data (such as protein interactions and coexpression networks) and how this can feed forward into gene function prediction algorithms. We find that high-quality gene function predictions can be made using data that possesses no information on which gene interacts with which. By examining a wide range of networks from mouse, human and yeast, as well as multiple prediction methods and evaluation metrics, we provide evidence that this problem is pervasive and does not reflect the failings of any particular algorithm or data type. We propose computational controls that can be used to provide more meaningful control when estimating gene function prediction performance. We suggest that this source of bias due to multifunctionality is important to control for, with widespread implications for the interpretation of genomics studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effectiveness of Cognitive-Behavioral Group Therapy on Guilt Feeling Among Family Caregivers of Patients With Alzheimer’s Disease

Objective: This study aimed at investigating the effectiveness of cognitive-behavioral group therapy on guilt feeling among family caregivers of patients with Alzheimer’s disease. Methods: This research was a quasi-experimental study where in the pre-test and post-test control group design was employed. The statistical population of this study consisted of the female family caregivers of...

متن کامل

The Effectiveness of Self-Compassion Education on the Shame and Guilt of Mothers of Children with Learning Disorders

Background & objectives: Parents of children with learning disorders experience more negative emotions than parents of normal children, therefore, they require attention and receiving psychological interventions.  This study made an effort to reduce the psychological problems of these mothers. The aim of this study was to investigate the effectiveness of self-compassion training on the shame an...

متن کامل

Metagenomic Guilt by Association: An Operonic Perspective

Next-generation sequencing projects continue to drive a vast accumulation of metagenomic sequence data. Given the growth rate of this data, automated approaches to functional annotation are indispensable and a cornerstone heuristic of many computational protocols is the concept of guilt by association. The guilt by association paradigm has been heavily exploited by genomic context methods that ...

متن کامل

Effectiveness of Cognitive Analytic Therapy on Reducing Guilt Fleeing and Doubt in People with Obsessive-Compulsive Disorder

Introduction: Obsessive-compulsive disorder is a chronic anxiety disorder that characterized by excessive preoccupation about orderliness and minor disputes and doubts. The current study aimed to survey the effectiveness of cognitive analytic therapy on reducing guilt fleeing and doubt in people with obsessive-compulsive disorder. Methods and Materials: For this purpose, among the people with ...

متن کامل

Genome-wide Association Study to Identify Genes and Biological Pathways Associated with Type Traits in Cattle using Pathway Analysis

Extended Abstract Introduction and Objective: Type traits describing the skeletal characteristics of an animal are moderately to strongly genetically correlate with other economically important traits in cattle including fertility, longevity and carcass traits. The present study aimed to conduct a genome wide association studies (GWAS) based on gene-set enrichment analysis for identifying the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011